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Abstract: In this paper wc consider Markov chains associated with the 
Metropolis-Hastings algorithm. We propose conditions under which the se- 
quence of the successive densities of such a chain converges to the target 
density according to the total variation distance for any choice of the initial 
density. In particular we prove that the positiveness of the target and the 
proposal densities is enough for the chain to converge. 
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1. Introduction and the main result 

The Metropolis-Hastings algorithm invented by Nicholas Metropolis at al. [10] 
and W. Keith Hastings [4] is one of the best recognized techniques in the statis- 
tical applications (see e.g. [2. 3, 6, 7, 14, 15, 16, 17, 20]). Throughout this paper 
we shall assume that the following condition arc valid. 

%: Let (X,^, A) be some measure space with a cr-finite measure A. Assume 
we are given a target probability distribution on (X, A) which is absolutely 
continuous with respect to A with density 7r(-) : X — > M-f for which 7r(x) > 
for all a; G X. Assume also wc are given an absolutely continuous with respect 
to A proposal distribution on (X,^) which density q{-\x) : X R_|- is set 
conditionally to a; G X. It is assumed that : X x X M.^ is jointly ^ x ^ 
measurable (see e.g. [5]). □ 

The Metropolis-Hastings algorithm consists of the following steps. Generate 
first initial draw a;(o). Let we know the current draw a;(„_i). To obtain the 
next draw x^n) one should generate a candidate a;* ~ q{x\x(n-i)) and accept 
the candidate with a probability a(a;(„_i) , x*) taking a;(„) = or reject the 
candidate with a probability 1 — Q;(a:(„_i), a;,) and take a;(„) = a;(ri-i) where 

I '\ ■ (^ 7r(xO q{x\x') \ , r n 

a(a;,a; j — mm 1, — — — tor 7r(a;)q(x a;) > 

\ ■n{x) q{x'\x) J 



1 
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and a(a;,a;') = 1 for T:{x)q{x'\x) = 0. All draws are taken from X. This scheme 
defines a transition kernel 

k{x — > x') = a{x^x')q{x'\x) + 6{x ~ x') j [1 — a{x, z)) q{z\x)\{dz). (1.1) 

where 5{-) is the delta function. The integral sign stands for the A integration 
over X (including where it is necessary the delta function rule). The notation 
K,{x — ^ x') stands for a function of two variables {x,x') G X x X associated (by 
analogy to the discrete state space) with the conditional probability to move 
from state x to state x' . According to the assumptions for 7r(-) and q{-\ ) the 
kernel (1.1) is nonncgativc function. This kernel fulfills the normalizing condition 
J k{x x')X{dx') = 1 but first of all it satisfies the detailed balance condition 
(reversibility of the chain) 

tt{x)k{x x') ~ tt{x')k{x' — ^ x) (1.2) 

which has to be verified only for x ^ x' . Actually in this case we have 

■k{x)k.{x — > x') ~ tt(x')k{x' — > x) = min {7r{x)q{x'\x), Tr{x')q{x\x')) . 

From the detailed balance condition it follows that the target density is invari- 
ant for the kernel, i.e. it holds 7r(x') = / Tr{x)K{x — >■ x')X{dx). The transition 
kernel (1.1) defines a Metropolis-Hastings Markov chain of X- valued random 
variables {X(^n)) according to the following rule. Define the initial random vari- 
able Ar(o) with some proper density /(o)(') : X R+. For any next random 
variable ^(„) the corresponding density /(„)(•) is defined by the recurrent for- 
mula 

I(n){x') ^ j f(n_i)[x)n{x ~¥ x')\{dx),n = 1,2,.... 

All the probability densities in this paper are considered with respect to the 
common reference measure A. 

One of the main problems arise here is to establish conditions under which 
the sequence (/(«)(•)) converges to the invariant density 7r(-). In the general 
case of stationary Markov chain usually proves that this sequence converges 
with respect to the total variation distance dTv- i-C that 

lim dTy(Ai[./(«)],MN) = 1™ \ I |/(„)(.t) - 7r(a;)|A((ia;) = (1.3) 

n— >-oo n— >oo Z J 

under the concepts of irreducibility, aperiodicity and reversibility (see e.g. [1, 2, 
5, 9, 11, 12, 15, 17]). Here by we denote the probability measure associated 
with the density /(•). 

In this paper (Theorem 3.1) we propose conditions under which (1.3) holds 
but we follow a somewhat different approach by means of the properly defined 
Hilbert space described for example in Stroock [21]. 

Concisely formulated our main practical result (Proposition 3.4) states that if 
both the target and the proposal densities are positive functions then (1.3) holds 
regardless from the shape of the initial density. Remember that we use no other 
constructive conditions besides the mentioned positiveness. In the theorems be- 
low we use essentially only the notion of the reversibility and positiveness. 
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2. The -L^(7r) structure 

Following Stroock [21] we shall consider the Hilbert space L?{t^) with an inner 
product 



{f.9)7r^J f{x)g{x)'n{x)\{dx). 
The space L'^{tt) consists of the measurable functions /(•) : X — > M for which 



\2,TT 



\fix)\M^)X{dx) < 
(see also e.g. [13, 18, 19]). Define the operator 
IC[f]{x) = [ n{x^ x')f{x')\{dx') 

' k{x -> x')!{x')\{dx') + (j){x)f{x) (2.1) 
which is formally conjugate to the basic transition operator of the chain 

^[/](a;') = J f{x)K{x ^ x')X{dx) 

= J f[x)k{x x')\{dx) + <j)[x'),f{x') (2.2) 
where the sub- kernel k{- •) : X x X — > R-|_ 

k(x — x') = min ( q{x'\x), ^ } q(x\x')\ 

V n^) J 

is nonnegative A x A measurable function and the function c/)(-) : X 
(f>{x) = / {1 — a{x, z)) q{z\x)X{dz) 



is measurable with < (/)(x) < 1 for x G X. Actually k{- — > •) stands for 
a transition kernel of the transition operator K, and simply is a kernel of the 
conjugate operator IC. 

Put Ki(x — >■ x') = k{x — > x') and compose formally the sequence of kernels 

K„(.T — > a;') = / K„_i(a; — >■ z)ki(z — > a:')A(dz), n = 2, 3, . . . , 



which are just the transition kernels of the transition-like operators M^" in a 
sense that 

IC"[f]{x') = J f{x)K„{x ^ x')X{dx) 

imsart-generic ver. 2012/08/31 file: L2-MHG-5.tex date: March 13, 2013 



D. Tsvetkov et al./On the convergence of the M-H Markov chains 

and the usual kernels of the operators K7^ , i.e. 

/C"[/](x) = / Kr,{x -> x')f{x')\{dx'). 



Put also ki{x — > x') = k{x — > x') and compose the sub-kernels 

K„(x — > a;') = y K„_i(a: — > z)ki(z ^> a:')A((iz), n = 2, 3, . . . . (2.3) 
One can find by induction that 

= I kn{x^x')J{x')\{dx')+ f Xnix ^ x')fix')X{dx') + rix)f{x) 



where x„(- — > •) : X x X ^ M+ is some nonnegative Ax A measurable function. 
One can verify that k„(- — > •) also satisfies the detailed balance condition and 
the Chapman-Kolmogorov equation 

Km+n{x x') = j Kr,i{x z)k„(z x')X{dz), m = 1, 2, . . . , 71 = 1, 2, . . . , 

and the same is true for the sub-kernel k„(- — > •). 

Proposition 2.1. Suppose % holds and let f G L^(7r). Then for the operator 
defined in (2.1) it holds 

1) lC[f] e i^(7r) and also 

||/C[/]|l2,. < ||/||2,x. (2.4) 

2) The operator /C : i^(7r) — ;> ^^(Tr) is self-adjoint and for its norm we have 

\\JC\\ < 1. (2.5) 

3) Suppose that there exists an integer n > 1 such that «„(•—;>•) > a.e. (A x A) 
in X X X where kn{- ^ •) is a composite sub-kernel defined in (2.3). Let also 
h G L'^(t^) be a function for which IC"[h] = h. Then there exists a constant 7 
such that h{-) ~ 7 a.e. (A) in X. 

Proof. 1) Let / G L^{tt). In the beginning we assume that /(•) is bounded, i.e. 
!/(•)! ^ a.e. (A) in X for some constant C. Put 

A{x)^ [ k{x ^ x')\f{x')\X{dx') 



and consider the identity 

|/(x')P = A^x) + 2A{x)i\f{x')\ - Aix)) + (|/(x')| - A{x)f 
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Multiply the latter with tiix — x') and integrate. Then receive 

k{x ^ x')\f{x')\'^\{dx') ^ A^{x) + j K.{x x'){\f{x')\ - A{x)f\{dx') 
beeause by construetion 

j n{x x')2A{x){\f{x')\ - A{x))\{dx') = 0. 

Therefore 

\m{^)? < Mfim' = A^x) < Jk{x^ x')\f{x')\'X{dx'). 
Multiplying the latter with Tr(x) and integrating over X we get 

Il^[/]||2,. = / |/C[/](x)|2^(x)A(dx) 

< f ( [ tt{x)k{x x')\f{x')\'^X{dx')^ X{dx) 
t:{x')k{x' ^ x)\f{x')\^X{dx')] X{dx) 



= \ 7r(a;')|/(a;')rA(d.T') = ""'^ 



i 112, TT- 

Here we firstly use the detailed balanee eondition (1.2) and secondly we use the 
Fubini's theorem which allows us to interchange the order of integration in the 
case of positive integrand. In this way we prove simultaneously the inequality 
(2.4) and the fact that /C[/] G i^(7r) for the case of bounded /(•). 

Now let /(•) be any function of 1?(t:). It is enough to show the validity of 1) 
for the modulus |/(-)l. Put |/(-)|m = niin(|/(-)|, m) where m > 1 is a positive 
integer. The function G i^(7r) is bounded therefore by the first part of the 
proof we have 

II^[|/|™]||L = 1 7:{x) (^J k{x ^ x')\f{x')UX{dx')^ X{dx) 



tt{x){ I k{x^x')\f{x')\^X{dx')+cb{x)\f{x)\,nj X{dx) 

<lll/|m|!L<ll/llL 



consequently 



^ 7r{x)gUx)Hdx) < \\f\\l^ (2.6) 

where 

gM =( I k{x-^ x')\f{x')\rnX{dx') + (j>{x)\f{x)\ 
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is an increasing sequence of positive measurable functions on X. Now we are at 
position to apply the Fatou's lemma in (2.6) and after passing to a limit for 
m — > oo conclude that 

^{x) k{x ^ x')\f{x')\X{dx') + H^)\f{x)\^ X{dx) < WfWl^ 

which proves simultaneously that /C[|/|] G L?{t^) and the inequality (2.4). 2) Let 
/ G Li^ij^) and g S i^(7r) and write by means of the Fubini's theorem and by 
the detailed balance condition 

mf],g)7, = J (^J k{x x')f{x')X{dx')J g{x)Tr{x)X{dx) 

tt{x)k{x x')f{x')X{dx')) g{x)X{dx) 



fix') i^J nix' ^ x)gix)Xidx)j 'K{x')X{dx') = {f,lC[g]).. 

which proves that the operator IC is self-adjoint. The inequality (2.5) follows 
immediately from (2.4). 3) Write the identity 

h^{x') = h^{x) + 2h{x){h{x') - h{x)) + {h{x') - h{x)f 

multiply with HLn{x — > x') and integrate. Then we get 

Kn{x x')h^{x')X{dx') ^ h^{x) + J Unix -> x'){h{x') - h{x)fX{dx') 

because 

Kn{x ^ x')2h{x){h{x') - h{x))X{dx') = 2h{x) (JC[h]{x) - h{x)) = 0. 
Multiply with ■k{x) and integrate. Then 

TT{x)Kn{x ^ x')h'^{x')X{dx') \ X{dx) = / TT{x)h'^ {x)X{dx) 



+ / Tl{x) I Kn{x X'){h{x') - h{x)fX{dx') X{dx) . (2.7) 



It is easy to see that the left-hand side in (2.7) is equal to the first addend in 
the right-hand side. Therefore 

){h{x') - h{x)fX(dx')^ X{dx) = 

which implies immediately that also 

7r(x) (^j kn{x x'){h{x') - h{x)fX{dx')^ X{dx) = 0. 

Now the inequalities 7r(-) > and k„(- — •) > give that there exists a constant 
7 such that h{-) = 7 a.e. (A) in X. □ 
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Proposition 2.2. Suppose H holds and let fC : L'^{tt) — > i^(7r) be the operator 
defined above. Then for any integer v > I the following assertions are valid. 

1) Every power K-^'^" , n = 1,2,..., is positive operator, i.e. {IC'^'^"'[h], h)^^ > 
for any h £ i^(7r). 

2) The sequence (/C^''") is decreasing, i.e. {IC^'"^+^''[h], h)^ < {JC^^'^lh], h)^ for 
any h € L^(7r), n = 1,2, . . .. 

3) All the operators (/C^^" - JO'^'-^+^^p) for n ^ 1,2, ... and p = 1,2, ... are 
also positive. 

Proof. 1) Let h E L^{t^). The operator /C is self-adjoint therefore 
(/C^-^/i],/!), = (/C'^"[/i],/C''"[/i]), >0. 

2) We have \\K.\\<1 therefore 

{K:'^'''^+'^''[h],h)^ = (/c''"+''[/i],/c'^"+'^[/i])^ 

3) The positiveness of the operator (/C^"^" - IC'^'^^+'^'^p) means that 

{{JC^"" - /C2''"+2'^P) [h], h)^ > 
for any h S L^ii^) that is equivalent to 

which follows immediately from 2). □ 

Here we are at position to prove that the operator sequence (A^^") has a strong 
limit. More precisely we are going to prove that for every / G i^(7r) there exists 
the limit 

lim /C"[/] = 

where 1 denotes the constant function which equals to one. 

Further we shall need the following condition of positiveness. 

T-Lp: Assume that there exists an integer > 1 for which kv{- — > •) > a.e. 
(A X A) in X X X. □ 

Theorem 2.1. Suppose % and Hp hold. Then for every f G L'^{'t) we have 

lim ||/C"[/]-(/,l).l||2.. = 0. (2.8) 

n— f oo 

Proof. It is not difficult to find out that for any real Hilbert space H with 
an inner product (•, •) and a norm ||u|| = -y/ {u, u), u G H, with a given linear 
bounded self-adjoint positive operator T : H ^ H it holds the inequality 

\\Tur<\\T\\(Tu,u). (2.9) 
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The proof of (2.9) will be given at the end of the paper. Choose arbitrary 
/ e L^(7r). Applying (2.9) to the positive operators (/C^'"" - JO'^^+'^'^p) for 
n = 1, 2, . . . and p = 1, 2, ... we get 

IK/c^''" - /c2''"+2'"p)[/]|||^ < ii/c^''" -/c2''"+2'^p||(/c2''"[/] -/c2''"+2'^p[/],/)^ 

from which follows that 

||^2.„[^] _ ^2.«+2.p[^]||2^^ < 2 ((/C^-' [/],/). - (/C2''"+2''P [/],/),) . (2.10) 

From Proposition 2.2 we know that the numerical sequence ((/C^''"[/], f)^)^^! is 
decreasing an bounded from below by zero therefore this sequence is convergent. 
Now from (2.10) it follows that the sequence of the powers (/C^'^"[/])^i is a 
Cauchy sequence in L'^{tt) therefore it has a limit h G L'^{tt) for which obviously 
it holds IC'^^[h] = h. From Proposition 2.1(3) (with n = 2v) we get that h{-) = 7 
a.e. (A) in X with some constant 7 because the stated positiveness of the sub- 
kernel — >■ •) in T-Cp provides that 

^2i/(a; — > x') = / K^(a: — > z)ku{z x')\{dz) > 



a.e. (Ax A) in XxX. We have /C2''"[/] ^ 7I whence (/C2'^"[/], 1)^ -> 7(1,1)^ =7 
which gives 

= (/,/C2-"[l])^ = (/c2-"[/],l)^ ^7 

therefore 7 = (./, l)7r which proves (2.8) for the subsequence of the powers 
(2i/n)^^i, i.e. that 

hm /C^-' [/] = (/,!) a. (2.11) 



From (2.11) we obtain 

lim /c2''"+™[/]= lim /C^^M-^'"!/]] 



- (/C"[/],l).l = (/,/C"[l])a = (2.12) 

which proves (2.8) for the all the power subsequences {2vn + ?7i)^i where m, 
1 < m < 2z^, is a nonzero remainder after a division by 2v. Now it is not difficult 
to see that the validity of (2.8) follows from (2.11) and (2.12). □ 



3. Convergence with respect to TV distance 

Our main purpose is to investigate the behavior of the operator sequence (/C") 
rather than the sequence (A^") where the transition operator K. is defined in (2.2) 
because it actually corresponds to the Markov chain. Note that if jii and are 
absolutely continuous probability measures (w.r.t. A) with densities /i(-) and 
/2(-) then for the total variation distance c?Ty (/ii, /i2) it holds (see e.g. [9, 17]) 

dTy(Mi,M2) = ^ j \fi{x) - f2{x)\\{dx). 
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Let be the Banach space of the measurable functions /(■) : X M for 
which 



\f{x)\\{dx) < oo 

provided with the usual norm || • ||i. Wc have 

ll/lli < II/MI2,. 
because (by Cauchy-Schwarz inequality) 



(3.1) 



||/||i= / \f{x)\\{dx) 



i:{x) 



■n{x) 



\{dx) - / v/^^ 



7r(x) 



\{dx) 



< J / 7T{x)X{dx)\ / TTix) 



fix) 



■n{x) 



From (3.1) it follows that if J/tt G L'^{ti) then / e L^. 



\{dx) = |[//7r|[2,7r. 



Proposition 3.1. Suppose % and "Hp hold. Let /(•) be a function such that 
f /t^ G -^^(tt) and put 7 = / f{x)X{dx). Then 



lim ||/C"[/]- 7^11 1 = 0. 



(3.2) 



Proof. First of all notice that (3.1) guarantees the existence of the constant 7. 
It can be shown by induction that 



/C"[/](x')=^(x')/C" 



ix'),n = l,2,..., 



whence by means of the Cauchy-Schwarz inequality we obtain 



- 7^l|i = / l^"[./](a:') - iTTix')\X{dx') 



Tr{x') 



1C 



/ 

TT 



(x) -7l(x') 



X{dx') 



< W / 7r(x') 



(x') - 7l(x') 



{x')~jlix') 
X{dx') = IC 



X{dx') 
-7I 



On the other hand by (2.8) it follows that 



lim 

n— ^00 



■ 71 



(3.3) 



(3.4) 
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because 



= / l^^{x)\{dx) - / f{x)\{dx)^^. 



Now the validity of (3.2) follows immediately from (3.3) and (3.4). 



□ 



The condition T-Lp holds for example when the proposal density is positive. 
Remember that the target density is positive by condition %. 

Proposition 3.2. Suppose H holds. Let q{-\-) > a.e. (A x A) in X x X. Let 
also f/n G i^(7r). Then (3.2) holds. 

Proof. By conditions we have ti{x) > for all x S X and q{-\-) > a.e. (A x A) 
in X X X. These inequalities guarantee that k{- — > •) > a.e. (A x A) in X x X 
therefore we can apply Proposition 3.1 (for = 1 in Hp) which completes the 
proof. □ 

Hereafter wc shall prepare for the final results. Put 



Obviously X„ C X„+i and X = U,^^iX„. For /(•) : X ^ t put /[„] {x) = f{x) 
where x G X^ and /[^j (x) ~ elsewhere. 

Proposition 3.3. Suppose T-L holds. Then the following assertions are true. 
1) Let f € L^. Then IC[f] G and 



X. 




Il^[/]||i<il/lli 



(3.5) 



consequently for any f G L^ 



and g & L 



and any n 



1,2,... it holds 



< 11/ -.gill- 



(3.6) 



2) Let f G L^ be a bounded function. Then /[„i]/7r G L'^{t:) and 

lim 11/ -/[„,] 111 -0. 



(3.7) 



3) Let f G L\ 



and put 




Then 



||7[„]7r - 77r||i < ||/ - /[™]||i. 



(3.8) 



Proof. 1) We have 
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whence (again by means of the Fubini's theorem) 

ii^[/]iii= / \m{x')\\{dx') 

< I [ I \f{x)\K{x ^ x')X{dx)] X{dx') = [ \f{x)\X{dx) = \\fh 



which proves (3.5). The vaUdity of (3.6) follows immediately from (3.5) and the 
linearity of K.. 

2) According to the assumption /(■) is bounded consequently for some constant 
C it holds |/[,„](.t)| < C for x G X,„. Then 



f[m]ix) 



7r(x) 



< Cm, X e X„ 



therefore 



7r(a:;) 



X{dx) < / n{x) \Cm\^ X{dx) < C^m^ < oo 



7r(a;) 

which proves that /[m]/7r G L'^{tt). By the definition 

ll/-/[™]lli= J |/(x)-/[,„](x)|A(dx) 

which proves (3.7) because X„i X. 
3) We have 



\f{x)\X{dx) 



ll7H7r-77i'lli = / \llni]T^{x') -jTr{x')\X{dx') 



l7[,n] - 7l / 7rix')X{dx') 



.f{x)X{dx) 



< 



1/ - .f[m]l|] 



which proves (3.8). □ 

We are ready to give more general conditions under which (3.2) holds. 

Theorem 3.1. Suppose T-L and T-Lp hold. Let f G and put J — J f{x)X{dx). 
Then 

lim ||/C"[/] -7^l|i =0. (3.9) 
Therefore if f E is a probability density function (w.r.t. X) on X then 

lim dw(/^[-€"[/]],/iM) =0. (3.10) 

n— >oo 

Proof. In the beginning of this proof we shall assume that the function / G 
is bounded. Put 



7[m] = / f[m]ix)X{dx),m ^ 1,2,. . 
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For any n = 1, 2, . . . and m = 1, 2, . . . wc can write 
||^"[/]-7^lli < ll^"[/]-/C"[/H]lli + ll'^"[/M]-7[m]'r||i + ||7M^-7'r||i (3.11) 

Choose some e > 0. By (3.7) fix an integer m > 1 such that ||/ — /[m]||i < £/3. 
Then according to (3.6) we obtain 

I|/C"[/]-/C"[/m]I1i<| (3.12) 

for any n = 1, 2, . . . and according to (3.8) we obtain 

s 

||7[m]7i--77r||i < -. (3.13) 

For such a fixed m we have from Proposition 3.3(2) that /[m]/7r G LP'{t^) there- 
fore by Proposition 3.1 we get tliat 

hm ||/C"[/[„,]]-7[™]^lli=0 
consequently we can choose an positive integer no such that 

ll^"[/[,n]]-7M^lli<| (3.14) 

for any n > uq. Replacing the incquahtics (3.12), (3.13) and (3.14) in (3.11) we 
receive that 

ll^"[/]-7^lli<| + | + |=£ 

for any n > uq which by definition proves the validity of (3.9) for the case of 
bounded f € L\. 

Choose now arbitrary / g L\ and put fm{x) = f(x) where |/(a;)| < m and 
fm{x) = elsewhere, m = 1, 2, . . .. In the same way as for (3.11) one can get 

||/C"[/] - 7^lli < l|/C"[/,„] - + 2||/,n - /111 

where 

7m = j fm{x)X{dx). 

From the first part of tlic proof we already know that for arbitrary fixed positive 
integer m it holds 

lim ||^"[/„]-7„^||i = 

n— >oo 

because f„i{-) is bounded. Now the validity of (3.9) is a consequence of the 
well-known fact that 

lim ||/,„-/||i -0. 

m— >-oo 

The validity of (3.10) follows immediately from (3.9). □ 

Theorem 3.1 allows us to enforce the Proposition 3.2. 

Proposition 3.4. Suppose H holds. Let q{-\-) > a.e (A x A) m X x X. Let 
also f eL\. Then (3.2) holds. 

Proof. Analogously to the proof of Proposition 3.2 we conclude that here we 
can apply Theorem 3.1 for v = 1 in Hp. □ 
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4. Remarks 

The convergence results according to the total variation distance described by 
Robert and Casella in [15] (see also e.g. [1, 2, 5, 9, 11, 12, 17]) use essentially 
the concepts of aperiodicity. The approach used here does not need the notion 
of aperiodicity. 

Let us pay more attention to some valuable facts that stand back in the 
proofs above. From the proof of Theorem 2.1 one can see that for any / e L'^{tt) 
the sequence (/C^"[/])5^i is a Cauchy sequence in the L'^{tt) norm. For the limit 
h £ L'^ (tt) of this sequence it holds /C^ [h] = h and from the proof of Proposition 
2.1 as for (2.7) one can see that 

J tt{x) (^J k2n{x x'){h{x') ~ h{x)f\{dx')^ X{dx) = 

for any n = 1, 2, . . .. This fact allows us to show (as in the proof of Theorem 3.1) 
that for any / £ the sequence (Ai^"[/])J^]^ is a Cauchy sequence in the 
norm. Consequently for any probability density function /(•) on X the sequence 
(/C^"[/])^i is a Cauchy sequence with respect to the total variation distance. 

Let us prove the validity of (2.9). The operator T is linear bounded self- 
adjoint and positive in the real Hilbert space H therefore it holds the Cauchy- 
like inequality 

|(Tu,w)|^ < {Tu,u){Tv,v) 
for all u G _ff and all v E H. Putting in the latter v ~ Tu we obtain 

\{Tu,Tu)\'^ = \\Tu\\^ < {Tu,u){T^u,Tu). (4.1) 

Now applying the classical Cauchy inequality and the inequality for the norm 
we get 

(tVt«) < liT^i. unroll < ||r||||T«||||r«|i = |iT||||r«f . 

Replacing the latter in (4.1) we get the inequality 

\\Tur<{Tu,u)\\T\\\\Tur 

that is equivalent to HTulP < ||T||(ru,w), i.e. (2.9). Certainly various proofs 
of (2.9) can be found in many places but we present a proof here for the sake 
of completeness taking into account the importance of this inequality in our 
construction. 

For example our approach comprises with nominal adaptation the case of 
the random sweep Gibbs sampler and the case of the random sweep Metropolis 
within Gibbs sampler. 
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